Evolution of surname distribution under gender-equality measurements 
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We consider a model for the evolution of the surnames distribution under a gender-equality mea- 
surement presently discussed in the Spanish parliament (the children take the surname of the father 
or the mother according to alphabetical order). We quantify how this would bias the alphabetical 
distribution of surnames, and analyze its effect on the present distribution of the surnames in Spain. 

O INTRODUCTION 

In Spain, as in many other countries, children usually inherit the surname of the father. As a consequence, the 



o 



surname of the mother is lost in the children's generation Nowadays, in Spain, parents can agree upon whether 
it is the mother's or the father's surname that is given to their children, but if parents do not reach an agreement, 
it will be the father's surname the one inherited by the children. Due to gender-equality issues, a new law is under 
study which would imply that, if parents do not reach an agreement, or if no wish is expressed, the surname inherited 
f-H , by the children will be selected according to the alphabetical order of the parent's two surnames. 

People have immediately realized that this implies a bias on the surnames favoring those beginning by the first letters 
in the alphabet (A,B,. . . ) and could mean the disappearance of surnames beginning by the last letters (. . . ,Y,Z). In 
q ' this short note, we quantify the effect of this bias on the present distribution of surnames in Spain. 
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O ' MODEL 
CO 

As a first order model that captures the essence of the process of surname inheritance we propose the following: 

(i) Initially, a population of 2N individuals (N male and N female) is considered. Each individual has a surname 
i_i , chosen according to some prescribed distribution. 

(ii) Males and females reproduce in random pairs in such a way that, on average, the total population remains 
constant. 

■ (in) With probability a it is assumed that parents reach an agreement, so that the surnames of the children arc 
\ chosen at random between those of the parents (it is not important for the results in which proportion they prefer 

r**»" ■ the father's or the mother's surname). With probability 1 — a, parents do not reach or do not express an agreement, 
and the children adopt the surname by the alphabetical order rule. 

We measure time t in average reproductions per person, or generations. In a generation, parents are replaced by their 
children in the population. 

This is a minimal model and does not consider many realistic issues: new surnames brought in by immigration, 
geographical distribution of surnames, etc. but those are expected to be second order effects with little impact in the 
overall trend. 

Let us define p(n, t) as the proportion of individuals (both males and females) with surname in the alphabetical 

■ position n = 1, ... , M, being M the total number of surnames. It evolves according to: 



dp(n,t) 

— -Q t — = (l-a)p(n,t) 



M n-1 

E KM)-$>(M) 

_k=n+l fc=l 



= (1 - a)p(n, t) [1 - P(n, t) - P(n - 1, t)} , (1) 



where P{n, t) = X)fc=i p{^i t) ^ s the cumulative distribution. It follows that 

dP(n,t) 



dt 

whose solution is: 



= (l-a)P(n,t)[l-P(n,t)], (2) 



P(n, 0)e ( - 1 ~ a ' )t 

P( "' f) = l + P(n,0)(e(i-»)'-l) • (3) 
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The distribution of surnames at time t is then p(l, t) — P(l,t) and p(n, t) — P(n, t) — P(n — 1, t) if n > 1. Approxi- 

P(n, 
dt 



mating the difference by a derivative p(n,t) ~ aF l" :t - ) , we obtain: 



p(n,t) = J- 4 

[l + P(n,0)(e( 1 - Q ) t - 1)] 

Eq. © shows that the distribution of surnames approaches a Kronecker-delta at n — 1 (P(n,t) = 1, Vn) exponentially 
fast with a characteristic time 1/ (1 — a). Assuming, for instance, that couples reach and agreement about the children's 
name and express it in 50% of the cases (a = 1/2), we find from Eq. ((4]) that the frequency of a surname around the 
end of the alphabetical table would be decreased by a factor 10 in around 4.6 generations^ 115 years). 



EVOLUTION OF CURRENT DISTRIBUTION 



We have applied the above results to the actual distribution of Spanish surnames. Besides the analytical result 
of Eq,(j4]), we have performed a numerical simulation of the model in which iV = 10 7 couples have probabilities 
(0.05,0.2,0.5,0.2,0.05) of having (0,1,2,3,4) children (average value is 2). The probability of parents reaching an 
agreement is set at a = 0.5. Independently on whether an agreement has been reached or not, the rule applied 
to the first-born child is used for all children. We have used as the initial condition p(n, 0) the distribution of the 
M = 100 most common surnames in Spain, as published by the INE after ordering them by alphabetical order. 
The evolution after n = 4 and n = 10 generations is plotted in the figure. The agreement between the simulation and 
the analytical result is excellent. 



CONCLUSIONS 



In our minimal model for surname transmission, we prove that the adoption of the alphabetical rule leads to an 
exponential decrease for the surnames in the last positions in the alphabetical order, with a characteristic decay time 
of 1/(1 — a) generations, begin a the fraction of parents that reach an agreement, This quantifies the decrease in the 
frequency of those surnames. 
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Fig. 1. -Evolution of the distribution of surnames after n = 4 (left) and n = 10 generations, taking as initial condition p(n, 0) 
the actual distribution of the M — 100 most common surnames in Spain. For n = 10 we have used a logarithmic scale for a 
better viewing of the data. The dots are the result of the numerical simulation of a more detailed model that includes the 
basic premises used in the derivation of the analytical expression. 
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[1] In Spain, though, the mother's surname is kept as a second surname. It is consequently totally lost in the grand-children's 
generation. 

[2] INE stands for "Instituto Nacional de Estadfstica" . The data are in the webpage www.ine.es. Similar data are available for 
other countries. Our simulation results only consider those 100 surnames for which data are publicly available. 



